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In this study we present a model for the integration of microsatellite genotyping with 
photographic identification of humpback whales, Megaptera novaeangfiae , using samples 
from the east coast of Australia as a case study. A suite of 10 microsatellite markers was 
selected for this study, based on recommendations made by ANZECC and discussions with 
other research groups. Seven of the 10 markers were successfully used to genotype 12 
sloughed skin samples from humpback whales on their northern migration along the east 
coast of Australia, resulting in 11 individual whales being identified. Two samples, collected 
from the same pod of whales, were found to be from one individual, as the genotypes of both 
samples were identical, while two further samples identified a pair of whales as a possible 
parent/offspring combination. In order to establish a worldwide database incorporating 
genetic and photographic identification of humpback whales, results must be standardised 
between research groups. To overcome potential technical difficulties of standardising 
results, we recommend that each research group sequence a reference sample or group of 
reference samples for each locus and that results are reported in repeat number rather than 
absolute PCR product size. □ MicrosateHite genotyping, humpback whale , Megaptera 
novaeangliae, photo identification. 
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Recent studies of humpback whales, Megaptera 
tiovaeangliae , have employed passive methods 
such as photographic identification of tail flukes 
and dorsal fins to examine site fidelity, basic 
social associations, migratory paths, population 
estimates and population growth (Isaacs & 
Dalton, 1992; Gill & Burke, 1999; Garrigue, 
2001). Although much knowledge has been 
derived from photo-ID studies, the technique can 
be inconsistent and subjective, and susceptible to 
human error (Corkeron et al., 1999). Further¬ 
more, young humpback whales can undergo 
extensive colour changes as they grow (Carlson 
& Mayo, 1990; Valsecchi & Amos, 1996), it is 
often difficult to approach animals due to 
behavioural responses and weather conditions, 
and individuals may lack distinguishing attributes 
necessary for unambiguous identification (Bain, 
1990; Stem et al., 1990; Valsecchi & Amos, 
1996). 

As a result the Australian and New Zealand 
Environment and Conservation Council 
(ANZECC) recommended that genetic analyses 
be integrated with conventional research methods, 
such as photographic identification, to address 
remaining issues concerning humpback whale 


populations. Microsatellite genotyping is a rapid, 
accurate and systematic technique, which can 
provide key insights into humpback whale 
ecology and evolution. While photo-ID does not 
lend itself readily to systematic profiling of 
individuals, it has the advantage of being a simple 
and obvious method of differentiating between 
individuals. A digital database incorporating both 
microsatellite genotyping and photo-ID of 
humpback whales would combine the advantages 
of each technique, providing information on pop¬ 
ulation sizes, more detailed social associations, 
sex identification, mating strategies, stock 
structure, gene flow and parentage. 

Southern Hemisphere humpback whales were 
classified into six stocks (Groups 1-VI) by the 
International Whaling Commission (IWC) based 
on their aggregations in Antarctic summer 
feeding grounds. Genetic analysis of Group IV, V 
and VI stocks are of particular interest to 
Australian humpback whale research. Discovery 
tagging and acoustic analysis of Group IV and V 
stocks indicate that mixing of these populations is 
likely to occur (Chittleborough, 1965; Paterson, 
1991), as is mixing of Group V and VI stocks 
(Valsecchi et al., 1997). Genetic differences 
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within and between these stocks remain unclear 
and movement patterns of individual whales 
across jurisdictional boundaries, within and 
between nations, need further investigation. 

This study utilises 10 microsatellite markers to 
genetically ‘fingerprint’ 12 humpback whale 
sloughed skin samples collected during the Cape 
Byron Whale Research Project 2000. The micro¬ 
satellite markers were selected as a standard set 
of genetic markers for humpback whale research 
in the Southern Hemisphere based on recom¬ 
mendations made by ANZECC (Corkeron et ah, 
1999) and discussion with other genetics 
laboratories in the Southern Hemisphere. Our aim 
was to establish a model for integrating micro¬ 
satellite genotyping with photo-ID of humpback 
whales migrating along the east coast of 
Australia. Such information would provide a 
basis for establishment of a Southern Hemisphere 
humpback whale database. 

METHODS 

SAMPLE COLLECTION. During the Cape 
Byron Whale Research Project 2000, 91 
sloughed skin samples were collected from 
humpback whales on their northern migration 
along the east coast of Australia. Where possible, 
accompanying photographs of the whale’s tail 
fluke and dorsal fins were taken when the skin 
was collected. Byron Bay was selected for this 
study due to the close proximity of whales to the 
mainland, which allowed both land- and 
sea-based surveys to be conducted. Twelve skin 
samples were selected for microsatellite geno¬ 
typing on the basis that each sample could be 
directly linked to an individual whale by being 
either the sole animal in a pod, or positively 
matched to a photo. DNA was extracted from 
approximately 1cm 2 of sloughed 
skin using the Tissue Protocol for 
the QIAamp DNA Mini Kit (Qiagen) 
according to the manufacturers 
instructions, with the exception that 
extracts were eluted with 2 X 1 OOjjlI 
of buffer AE, instead of the 2 x 
200 (xl recommended. 


LABORATORY ANALYSIS. Ten 
humpback whale microsatellite loci 
were selected for microsatellite 
genotyping; EV14, EV21, EV37, 
EV94, EV96 & EV104 (Valsecchi 
& Amos, 1996), and GATA28, 
GATA53, GATA417 & TAA31 
(Palsboll et al., 1997a). The forward 


primer of each locus was fluorescently labelled so 
that for each individual, PCR products could be 
combined for genotyping in two lanes of an 
automated sequencing gel without products over¬ 
lapping in colour or expected size range (Table 1). 

PCR amplifications were carried out separately 
for each individual/locus combination before PCR 
products were combined for gel separation. Locus 
EV94 was unable to be optimised and was not 
used for further analysis. PCR reaction mixtures 
contained: 1 X reaction buffer (Biotech), 0.1 mM 
of each dNTP, 0.1 p-M of each of the forward and 
reverse primers, 0.55 units Taq (Biotech), 2.5mM 
MgCl 2 ,4 pi of genomic DNA, and Milli-Q water 
to a total volume of 20pl. PCR reactions were 
performed on a PC960G thermal cycler (Corbett 
Research, Sydney) and run under the following 
conditions: 1 minute initial denaturation at 92°C, 
followed by 35 cycles of 10 seconds denaturation 
at 92°C, 30 seconds annealing at the optimised 
temperature (Table 1), and 1 minute extension at 
75°C followed by a final extension step of 75°C 
for 5 minutes. 

Genotyping of PCR products was conducted 
on an ABI Prism 310 genetic analyser (Applied 
Biosystems) using Genescan-500 TAMRA as an 
internal size standard. Results were displayed 
using Genescan software (Applied Biosystems). 
Genotypes were scored using Genotyper 
software (Applied Biosystems). Loci GATA417 
and TAA31 were unable to be genotyped for the 
majority of samples and were not used for further 
analysis. 

DATA ANALYSIS. In cases where two identical 
genotypes were found, the specific probability of 
identity (POI) for that exact genotype was 
calculated based on the POI formulae of Paetkau 
& Strobeck (1994). Due to the low sample size, 


TABLE 1. PCR amplification conditions, number of alleles and expected 
and observed allele size ranges for the 10 micro satellite loci. 


Locus 

Dye label 

Annealing 

Number of 

Expected size 

Observed size 

temp. (°C) 

alleles 

range (bp) 

range (bp) 

■ EV14 

FAM 

« 

;i 

125-145 

128-142 

EV21 

FAM 

48 

h « 

107-117 

107-119 

EV37 

TET 

50 

12 

190-228 

190-218 

EV94 

TET 

- 

- 

202-222 

- 

| EV96 

FAM 

48 

8 

185-213 

190-210 

EV104 

TET 

48 

3 

143-153 

141-145 

GATA28 

TET 

48 

5 

147-191 

144-178 

GATA53 

FAM 

48 

9 

178-210 

232-278 

GATA417 

HEX 

50 

. 

193-293 

_ 

TAA31 

TET 

48 

- 

85-121 

- 
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we could not accurately determine allele 
frequencies and therefore the POI calculations 
lacked precision, however, the error was small. 

RESULTS 

DNA from all 12 sloughed skin samples was 
amplified successfully at 7 loci (EV14, EV21, 
EV37, EV96, EV104, GATA28 and GATA53). 
Locus EV94 was unable to be optimised, while 
GATA417 and TAA31 were optimised success¬ 
fully for PCR however did not amplify for the 
majority of samples. All 7 loci that successfully 
amplified were found to be polymorphic, 
exhibiting between 3 and 12 alleles (Table 1). 
The level of allelic diversity detected was similar 
to that of other studies, despite the comparatively 
small sample size (Valsecchi & Amos, 1996; 
Palsboll et al., 1997a). The expected level of 
heterozygosity for each locus ranged between 
0.55 and 0.98. 

Alleles that were potentially unique to this 
study were detected at 5 loci, E V14, E V21, EV104, 
GATA28 and GATA53. All samples genotyped at 
locus GATA53 displayed a marked difference in 
the size range observed compared to that 
expected, with as much as a 66 base difference 
(Table 1). Four other loci displayed alleles 
outside their expected size ranges, but in each 
case this was only a difference of one repeat unit. 

Samples B73 and B74 displayed identical 
genotypes at all 7 loci, while no two other 
samples shared the same genotype at more than 3 
loci. These samples were collected in the vicinity 
of 2 whales migrating together. Samples B3 and 
B5 were the only other two samples (excluding 
B73 and B74) which had at least one allele 
corresponding at all 7 loci genotyped (Table 2). 


These samples were also collected from within a 
pod of two whales migrating together. 

Accompanying tail fluke photographs were 
obtained for 6 of the 12 skin samples, with each 
displaying a large variation in the degree of 
photo/camera angle, lighting and weather 
conditions. Photo-IDs wl"° matched alongside 
their respective microsatcllLe genotypes for 
comparative purposes (Fig. 1). 

DISCUSSION 

For over two decades attempts have been made 
to photographically identify individual 
humpback whales around the world. Recent 
advances in genetic techniques now provide a 
more informative form of individual identificat¬ 
ion. Microsatellite genotyping can provide 
information on contemporary population 
structure, gene flow, abundances, relatcdness and 
genetic diversity (e.g. McRae & Kovacs, 1994; 
Richard et al., 1996; Call et al., 1998; Palsboll et 
al., 1997b), enhancing the information available 
from photo-1 D research. 

In this study, 11 individuals were positively 
identified using a suite of 7 hypervariable micro¬ 
satellite loci. The level of variation detected 
could distinguish all but 2 of the 12 samples from 
as little as 1 locus (e.g. Fig. 1, GATA53), 
illustrating the accuracy with which this method 
can identify individual humpback whales. The 
two samples (B73 and B74) that could not be 
distinguished are most likely to be from one 
whale sampled twice. The specific probability of 
identity of the exact genotype shared by these two 
samples is 8.189 x 1CH 7 ; calculated from the 
frequency of alleles observed, which is an 
imprecise estimate of the frequency of alleles in 


TABLE 2. Genotypes of humpback whale sloughed skin samples for 7 microsatellite loci. 


Locus sample 

EV14 

EV21 

EV37 

EV96 

EV104 

GATA28 

GATA53 

A7 

128/130 

113/115 

192/194 

198/202 

143/143 

144/152 

232/244 

A8 

130/ 132 

109/109 

196/204 

196/202 

143/ 145 

144/144 

248/ 256 

B3 

130/ 132 

107/ 109 

206/218 

196/ 200 

143/145 

144/144 

232/ 260 

B5 

130/136 

109/109 

190/ 206 

200/202 

143/143 

144/148 

232/256 

B54 

130/ 132 

107/109 

202/ 208 

200/ 204 

141/145 

148/176 

236/248 

B73 

130/ 136 

113/115 

198/210 

196/206 

143/143 

152/ 180 

252/ 252 

B74 

130/136 

113/115 

198/210 

196/206 

143/ 143 

152/180 

252/ 252 

B78 

132/136 

119/119 

210/218 

194/202 

143/ 143 

- 

. 

B92 

132/132 

109/111 

190/192 

202/202 

143/145 

144/152 

256/ 264 

B97 

138/ 142 

107/115 

192/194 

196/196 

143/143 

144/144 

232/248 

C2 

130/130 

109/109 

194/ 206 

198/200 

143/143 

144/152 

248/ 252 

E10 

130/ 130 

107/115 

196/212 

204/210 

143/ 143 

144/ 144 

232/276 
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Locus EV21 EV14 EV104 GATA28 EV96 EV37 GATA53 

Genotype 109/109 130/132 143/145 144/144 196/202 196/204 248/256 
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Locus EV21 EV14 EV104 GATA28 EV96 EV37 GATA53 

Genotype 107/109 1 30/132 143/145 144/144 196/200 206/218 232/260 



Locus EV21 EV14 EV104 GATA28 EV96 EV37 GATA53 

Genotype 107/109 130/132 141/145 144/144 200/204 202/208 236/248 



Locus EV21 EV14 EV104 GATA28 EV96 EV37 GATA53 

Genotype 113/115 130/136 143/143 152/152 196/206 198/210 252/252 




cdL 


A. fti 


A 



Locus EV21 EV14 EV104 GATA28 EV37 EV96 GATA53 

Genotype 109/111 132/132 143/145 144/152 190/192 202/202 256/264 


i 




Locus EV21 EV14 EV104 GATA28 EV37 EV96 GATA53 

Genotype 107/115 138/142 143/143 144/ 144 192/194 196/196 232/248 


FIG 1 . Individual identifications of humpback whales migrating north past Byron Bay, using photographic 
identification and microsatellite genotyping. Genotypes are shown for 7 microsatellite loci. 


the population, and is therefore only an estimate 
of the exact probability of identity. Considering 
the extremely low probability of these two 
sample genotypes being identical by chance 


alone and the fact that they were obtained from 
the one pod of whales, it is likely that these two 
samples came from the same whale. 
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This result highlights a potential souree of error 
when using sloughed skin for microsatellite 
genotyping purposes. Skin can remain in the 
water eolumn for up to 20 minutes after being 
dislodged (Corkeron et al., 1999), therefore 
samples collected from pods containing several 
whales are less reliable for microsatellite geno¬ 
typing purposes. Valsecchi et al. (1998) found 
biopsy darting to be the most efficient method for 
matching samples to individuals, however this 
technique still cannot guarantee a mateh between 
a mierosatellite genotype and photo-ID. 

Samples B3 and B5 were found to share at least 
one allele in common at all 7 loci genotyped, 
suggesting that these two individuals may be 
related. Other studies have shown that associations 
between humpbaek whales tend to be non-related 
(except in the case of mother-ealf pairs) and 
transient, with few pairs being associated through 
time (Falcone et al., unpubl. data). Valsecchi et al. 
(in press) concluded that migrating humpbaek 
whales did not select their travelling companions 
based upon relatedness at any stage of the 
migration. Results of the present study, however, 
suggest that humpbaek whales may migrate as 
family units, as both individuals sampled were 
adults and not a mother-calf pair. Unfortunately, 
due to the small sample size and the limited 
number of loci genotyped, the inference of 
familial relationships based on allele frequencies 
is not strong. In order to definitively determine 
potential relationship between individuals, as 
many as 17 loci may need to be genotyped to 
minimise the ehance of random matehes 
(Palsboll, 1999). 

For data to be shared effectively between 
research groups there are potential technical 
errors that need to be addressed, ineluding: 
non-templated addition of a single adenine base 
by Taq DNA polymerase during PCR (Brown- 
stein et al., 1996; Magnuson et al., 1996); allelic 
dropout resulting from poor quality template 
DNA due to degraded or low quantity DNA 
(Jame & Lagoda, 1996); null or non-amplifying 
alleles (Brookfield, 1996; Jarne & Lagoda, 
1996); calibrating PCR product size scoring across 
hardware; and confirmation of amplification of 
the correct locus. 

Addition of an adenine base during PCR (+A) 
can cause problems in allele scoring during 
genotyping (Magnuson et al., 1996). The 
frequency of +A addition ean vary within and 
between loci, as well as within and between 
different gel runs, and can be affeeted by different 


DNA polymerases (Brownstein et al., 1996; 
Magnuson et al., 1996). Several procedures can 
be used to overcome this problem. 1) A reference 
sample or group of reference samples should be 
sequeneed for eaeh locus and always included in 
every PCR and gel. The correct product size can 
then be determined and correct binning 
boundaries set. 2) Alleles should be recorded as 
numbers of repeats rather than absolute PCR 
product size. 3) Different combinations of primer 
modification and DNA polymerase can be used to 
either induee 100% +A addition or reduce +A 
addition to 0%, so that results can be standardised 
aeeordingly. 

When using small quantities of poor quality or 
degraded DNA, often only one allele of a hetero¬ 
zygous individual is detected (Taberlet & Luikart, 
1999). This type of error, called allelic dropout, 
ereates an artificial exeess of homozygotes, 
possibly resulting in departures from Hardy- 
Weinberg equilibrium. A similar problem is the 
amplification of null alleles, which occurs when 
mismatches in the priming site of one allele cause 
the failure of that allele to be amplified, again 
eausing an excess of homozygotes. Allelic 
dropout and null alleles ean be differentiated as 
allelic dropout is associated with low quality 
DNA and therefore ean be detected across loei 
within an individual, whereas null alleles are 
associated with a specific locus and can be 
detected across individuals within a locus. 
Another potential problem associated with null 
alleles, is the use of primers designed for one 
species to amplify a homologous loeus in another 
speeies. In sueh instances more species specific 
primers my need to be designed. If reference 
samples for a locus have been sequeneed and 
aligned, conserved sequence bloeks ean be 
identified so that new primers ean be designed for 
those regions, reducing the risk of null alleles. By 
recording results as repeat numbers rather than 
PCR produet size, allele sizes can be directly 
compared between different primer pairs for the 
same locus. 

The use of different hardware for genotyping 
can result in identical samples being scored as 
different sizes. Calibration of hardware within 
and between research groups can be achieved by 
sequencing a reference individual or a group of 
reference individuals for each loeus, and always 
ineluding these reference samples in every PCR 
and gel. Furthermore, allele seoring can be 
standardised by recording data as repeat number 
rather than PCR product size. A number of loci in 
this study exhibited what seemed to be extensions 
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to their known size range, but these may have 
been the result of incorrect scoring or binning of 
genotypes, or non-calibration of hardware 
between laboratories. Hardware calibration and 
standardised definition of binning boundaries are 
therefore essential to eliminate potential scoring 
errors. 

When comparing results between research 
groups it is vital to ensure that the same locus has 
been amplified in all instances. Sequencing of a 
reference individual or group of reference 
individuals will establish whether or not the same 
locus is being amplified. While such an event 
may seem unlikely, it did occur in this study. For 
locus GATA53 we used the primers published in 
Palsboll et al. (1997a). Our results showed that 
the size range differed from the expected by 66 
bases. Investigation revealed that one of the 
published primers was unlikely to be the primer 
used in that study. Furthermore, when we 
compared the two primer pair sets on the same 
individuals, not only did the allele sizes differ, but 
the relative allele size ranges within individuals 
also differed. Despite the change in only one 
primer, it appeared that a different microsatellite 
locus had been amplified. The only method to test 
this hypothesis would be to sequence the products 
for both sets of primers. 

This study presents a model for the integration 
of microsatellite genotyping with photographic 
identification of humpback whales, using 
samples from the east coast of Australia as a case 
study. A standardised digital genetic database 
would greatly benefit research of humpback 
whale populations through sharing of results 
worldwide among research groups and be of 
immense value for conservation and management 
purposes. Integration of such a database with 
current photo-ID databases would enhance the 
value of each by incorporating the accuracy of 
microsatellitc genotyping with the wealth of 
photo-ID data available. 
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